25 research outputs found

    TIGER : The gene expression regulatory variation landscape of human pancreatic islets

    Get PDF
    Genome-wide association studies (GWASs) identified hundreds of signals associated with type 2 diabetes (T2D). To gain insight into their underlying molecular mechanisms, we have created the translational human pancreatic islet genotype tissue-expression resource (TIGER), aggregating >500 human islet genomic datasets from five cohorts in the Horizon 2020 consortium T2DSystems. We impute genotypes using four reference panels and meta-analyze cohorts to improve the coverage of expression quantitative trait loci (eQTL) and develop a method to combine allele-specific expression across samples (cASE). We identify >1 million islet eQTLs, 53 of which colocalize with T2D signals. Among them, a low-frequency allele that reduces T2D risk by half increases CCND2 expression. We identify eight cASE colocalizations, among which we found a T2D-associated SLC30A8 variant. We make all data available through the TIGER portal (http://tiger.bsc.es), which represents a comprehensive human islet genomic data resource to elucidate how genetic variation affects islet function and translates into therapeutic insight and precision medicine for T2D.Peer reviewe

    Physiological Roles of ArcA, Crp, and EtrA and Their Interactive Control on Aerobic and Anaerobic Respiration in Shewanella oneidensis

    Get PDF
    In the genome of Shewanella oneidensis, genes encoding the global regulators ArcA, Crp, and EtrA have been identified. All these proteins deviate from their counterparts in E. coli significantly in terms of functionality and regulon. It is worth investigating the involvement and relationship of these global regulators in aerobic and anaerobic respiration in S. oneidensis. In this study, the impact of the transcriptional factors ArcA, Crp, and EtrA on aerobic and anaerobic respiration in S. oneidensis were assessed. While all these proteins appeared to be functional in vivo, the importance of individual proteins in these two major biological processes differed. The ArcA transcriptional factor was critical in aerobic respiration while the Crp protein was indispensible in anaerobic respiration. Using a newly developed reporter system, it was found that expression of arcA and etrA was not influenced by growth conditions but transcription of crp was induced by removal of oxygen. An analysis of the impact of each protein on transcription of the others revealed that Crp expression was independent of the other factors whereas ArcA repressed both etrA and its own transcription while EtrA also repressed arcA transcription. Transcriptional levels of arcA in the wild type, crp, and etrA strains under either aerobic or anaerobic conditions were further validated by quantitative immunoblotting with a polyclonal antibody against ArcA. This extensive survey demonstrated that all these three global regulators are functional in S. oneidensis. In addition, the reporter system constructed in this study will facilitate in vivo transcriptional analysis of targeted promoters

    Clusters of Conserved Beta Cell Marker Genes for Assessment of Beta Cell Phenotype

    Get PDF
    The aim of this study was to establish a gene expression blueprint of pancreatic beta cells conserved from rodents to humans and to evaluate its applicability to assess shifts in the beta cell differentiated state. Genome-wide mRNA expression profiles of isolated beta cells were compared to those of a large panel of other tissue and cell types, and transcripts with beta cell-abundant and -selective expression were identified. Iteration of this analysis in mouse, rat and human tissues generated a panel of conserved beta cell biomarkers. This panel was then used to compare isolated versus laser capture microdissected beta cells, monitor adaptations of the beta cell phenotype to fasting, and retrieve possible conserved transcriptional regulators.Journal ArticleSCOPUS: ar.jinfo:eu-repo/semantics/publishe

    Genome-Wide Identification of Transcription Start Sites, Promoters and Transcription Factor Binding Sites in E. coli

    Get PDF
    Despite almost 40 years of molecular genetics research in Escherichia coli a major fraction of its Transcription Start Sites (TSSs) are still unknown, limiting therefore our understanding of the regulatory circuits that control gene expression in this model organism. RegulonDB (http://regulondb.ccg.unam.mx/) is aimed at integrating the genetic regulatory network of E. coli K12 as an entirely bioinformatic project up till now. In this work, we extended its aims by generating experimental data at a genome scale on TSSs, promoters and regulatory regions. We implemented a modified 5′ RACE protocol and an unbiased High Throughput Pyrosequencing Strategy (HTPS) that allowed us to map more than 1700 TSSs with high precision. From this collection, about 230 corresponded to previously reported TSSs, which helped us to benchmark both our methodologies and the accuracy of the previous mapping experiments. The other ca 1500 TSSs mapped belong to about 1000 different genes, many of them with no assigned function. We identified promoter sequences and type of σ factors that control the expression of about 80% of these genes. As expected, the housekeeping σ70 was the most common type of promoter, followed by σ38. The majority of the putative TSSs were located between 20 to 40 nucleotides from the translational start site. Putative regulatory binding sites for transcription factors were detected upstream of many TSSs. For a few transcripts, riboswitches and small RNAs were found. Several genes also had additional TSSs within the coding region. Unexpectedly, the HTPS experiments revealed extensive antisense transcription, probably for regulatory functions. The new information in RegulonDB, now with more than 2400 experimentally determined TSSs, strengthens the accuracy of promoter prediction, operon structure, and regulatory networks and provides valuable new information that will facilitate the understanding from a global perspective the complex and intricate regulatory network that operates in E. coli

    Predicting DNA-Binding Specificities of Eukaryotic Transcription Factors

    Get PDF
    Today, annotated amino acid sequences of more and more transcription factors (TFs) are readily available. Quantitative information about their DNA-binding specificities, however, are hard to obtain. Position frequency matrices (PFMs), the most widely used models to represent binding specificities, are experimentally characterized only for a small fraction of all TFs. Even for some of the most intensively studied eukaryotic organisms (i.e., human, rat and mouse), roughly one-sixth of all proteins with annotated DNA-binding domain have been characterized experimentally. Here, we present a new method based on support vector regression for predicting quantitative DNA-binding specificities of TFs in different eukaryotic species. This approach estimates a quantitative measure for the PFM similarity of two proteins, based on various features derived from their protein sequences. The method is trained and tested on a dataset containing 1 239 TFs with known DNA-binding specificity, and used to predict specific DNA target motifs for 645 TFs with high accuracy
    corecore